Skip to main content

This item appears to not have any files that we can let you "experience" (like watching a video or viewing images) in this area.

We suggest you try the [DOWNLOAD OPTIONS] area to the right below to see if there are any files you would like to try to use or download.

Stack Exchange Data Dump


Published March 16, 2015


This is an anonymized dump of all user-contributed content on the Stack Exchange network. Each site is formatted as a separate archive consisting of XML files zipped via 7-zip using bzip2 compression. Each site archive includes Posts, Users, Votes, Comments, PostHistory and PostLinks. For complete schema information, see the included readme.txt.

We recommend downloading via bittorrent: https://archive.org/download/stackexchange/stackexchange_archive.torrent


All user content contributed to the Stack Exchange network is cc-by-sa 3.0 licensed, intended to be shared and remixed. We even provide all our data as a convenient data dump.
License: http://creativecommons.org/licenses/by-sa/3.0/
But our cc-by-sa 3.0 licensing, while intentionally permissive, does require attribution:
Attribution — You must attribute the work in the manner specified by the author or licensor (but not in any way that suggests that they endorse you or your use of the work).
Specifically the attribution requirements are as follows:
  1. Visually display or otherwise indicate the source of the content as coming from the Stack Exchange Network. This requirement is satisfied with a discreet text blurb, or some other unobtrusive but clear visual indication.

  2. Ensure that any Internet use of the content includes a hyperlink directly to the original question on the source site on the Network (e.g., http://stackoverflow.com/questions/12345)

  3. Visually display or otherwise clearly indicate the author names for every question and answer used

  4. Ensure that any Internet use of the content includes a hyperlink for each author name directly back to his or her user profile page on the source site on the Network (e.g., http://stackoverflow.com/users/12345/username), directly to the Stack Exchange domain, in standard HTML (i.e. not through a Tinyurl or other such indirect hyperlink, form of obfuscation or redirection), without any “nofollow” command or any other such means of avoiding detection by search engines, and visible even with JavaScript disabled.

For more information, see the Stack Exchange Terms of Service.


Identifier stackexchange
Publicdate 2014-01-21 18:54:32
Mediatype data
Addeddate 2014-01-21 18:54:32
Creator Stack Exchange, Inc.
Date 2015-03-16
Year 2015
Year 2014
Contributor Stack Exchange Community
Licenseurl http://creativecommons.org/licenses/by-sa/3.0/

Reviews

Reviewer: sathvik - - May 22, 2015
Subject: Thanks for sharing
Thanks for sharing the community data. It will greatly benefit research groups.
Reviewer: dmpetrov - - May 3, 2015
Subject: April data
Great data set. Thank you for sharing.

I see only March data. How can I get April data?
What about January and February?

Thanks,
Dmitry
Reviewer: alisa1 - - April 10, 2015
Subject: Resolved!
I also tried couple of times. It was failed at the same point.
But then I tried when I logged in, and I was able to download the whole file :-)
Reviewer: big_t_dub - - April 3, 2015
Subject: 70%
stuck at 70.7% download complete via utorrent- arg!

this should be made avail via ftp!!!
Reviewer: Ihor Bobak - - March 27, 2015
Subject: File is broken
At the top right corner of this page there is a link to zip archive. I've downloaded it twice (on different machines, in different countries). The file was always broken.

Torrent stucks on 70.8%.

Can anyone help to get this file?
Reviewer: gnijuohz - - March 25, 2015
Subject: No seed?
It stopped at around 70%.
Reviewer: klitzkrieg - - March 24, 2015
Subject: Seeds for 3/16/15 version?
Everybody's stuck at 70.7%
Reviewer: Nemo_bis - - February 7, 2015
Subject: Thanks and tests
Thanks for the September update, eager to see the next one. Did someone try importing this data into a StackExchange instance?

Fun to see how small the whole SE network is after all, only few GB compressed. Wikimedia projects dumps compress very well too, but they're still much bigger (while fitting a common hard disk anyway!).
Reviewer: shamsazad - - August 19, 2014
Subject: Latest Dump.
When will be latest dump from stackoverflow will be posted over here.
Reviewer: Jenson555 - - July 26, 2014
Subject: Really Cool
This is an Awesome Stuff..Cheers..:)
DOWNLOAD OPTIONS
7Z
BACK
In Collection
Community Texts
Uploaded by
Stack Exchange
on 1/21/2014
Views
24,110
Favorites
4
Reviews
10
PEOPLE ALSO FOUND
Community Texts
27
0
0
Community Texts
by Stack Exchange, Inc.
23,867
4
10
( 10 reviews )
Community Texts
by Jolyon C. Parish
5
0
0
Community Texts
by Luiz Roberto Fontes
51
0
0
Community Texts
by Spirit of Revolt.
2
0
0
Source: http%3A%2F%2Fwalkingpapers.org%2F
Community Texts
2
0
0
Community Texts
1
0
0